Generation Of Noun Compounds In Hebrew: Can Syntactic Knowledge Be Fully Encapsulated?
نویسندگان
چکیده
Hebrew •includes a very productive noun-compounding construction called smixut. Because smixut is marked morphologically and is restricted by many syntactic constraints, it has been the focus of many descriptive studies in Hebrew grammar. We present the treatment of smixut in HUGG, a FUF-based syntactic realization system capable of producing complex noun phrases in Hebrew. We contrast the treatment of smixut with noun-compounding in English and illustrate the potential for paraphrasing it introduces. We Specifically address the issue of determining when a smixut construction can be generated as opposed to other semantically equivalent constructs. We investigate several competing hypotheses smixut is lexically, semantically and/or pragmatically determined. For each hy: pothesis, we explain why the decision to produce a smixut construction cannot be reduced to a computation Over features produced by an outside module that Would not need to know about the smixut phenomenon. W e conclude that smixut provides yet another theoretical example where the interface that a syntactic realization component presents to the other components of a generation architecture cannot be made as isolated as we would hope. While the syntactic constraints on smixut are encapsulated within HUGG, the input Specification language to HUGG must contain a feature that specifies that smixut is requested if possible. • However, because smixut accounts for close to half the cases of NP modifiers observed on a corpus of complex NPs, and it •appears to be the unmarked realization form for some frequent semantic relations, we empirically evaluate a default setting strategy for the feature use-smixut based on a simple semantic Classification of the relations head-modifier in the NP. This study provides a Solid ground for the definition of a small set of predicates in the input specification language to HUGG, that has applications beyond the selection of smixut for the determination of the order of modifiers in the NP and the use of stacking vs. conjunction a n d for the definition of a bilingual input specification language. 1 I n t r o d u c t i o n Over the past three years,• we have s ta r ted developing HUGG, a syntact ic realization component for Hebrew. One of our object ives is t o investigate constraints on the design of the input specification language to a syntact ic realization component through a contrast ive analysis of t he requirements of English and Hebrew. By design, we are a t t empt ing to keep the input to H U G G as similar as possible t o the one we defined in the S U R G E syntact ic realization for English [7]. A detai led
منابع مشابه
A Lexical Resource of Hebrew Verb-Noun Multi-Word Expressions
A verb-noun Multi-Word Expression (MWE) is a combination of a verb and a noun with or without other words, in which the combination has a meaning different from the meaning of the words considered separately. In this paper, we present a new lexical resource of Hebrew Verb-Noun MWEs (VN-MWEs). The VN-MWEs of this resource were manually collected and annotated from five different web resources. I...
متن کاملDefiniteness in the Hebrew Noun Phrase
This paper suggests an analysis of Modern Hebrew noun phrases in the framework of HPSG. It focuses on the peculiar properties of the definite article, including the requirement for definiteness agreement among various elements in the noun phrase, definiteness inheritance in constructstate nominals, the fact that the article does not combine with constructs and the similarities between construct...
متن کاملDeeniteness in the Hebrew Noun Phrase
This paper suggests an analysis of Modern Hebrew noun phrases in the framework of HPSG. It focuses on the peculiar properties of the deenite article, including the requirement for deeniteness agreement among various elements in the noun phrase, deeniteness inheritance in construct-state nominals, the fact that the article does not combine with constructs and the similarities between construct-s...
متن کاملDisambiguating Noun Compounds With Latent Semantic Indexing
Technical terms in text often appear as noun compounds, a frequently occurring yet highly ambiguous construction whose interpretation relies on extra-syntactic information. Several statistical methods for disambiguating compounds have been reported in the literature, often with quite impressive results. However, a striking feature of all these approaches is that they rely on the existence of pr...
متن کاملNoun Phrase Chunking in Hebrew: Influence of Lexical and Morphological Features
We present a method for Noun Phrase chunking in Hebrew. We show that the traditional definition of base-NPs as nonrecursive noun phrases does not apply in Hebrew, and propose an alternative definition of Simple NPs. We review syntactic properties of Hebrew related to noun phrases, which indicate that the task of Hebrew SimpleNP chunking is harder than base-NP chunking in English. As a confirmat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998